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£Nl ■ Abstract 



We study questions in incidence geometry where the precise position of points is 'blurry' 
(e.g. due to noise, inaccuracy or error). Thus lines are replaced by narrow tubes, and more 
generally affine subspaces are replaced by their small neighborhood. We show that the 
presence of a sufficiently large number of approximately collinear triples in a set of points 
in C d implies that the points are close to a low dimensional affine subspace. This can be 
viewed as a stable variant of the Sylvester-Gallai theorem and its extensions. 

Building on the recently found connection between Sylvester-Gallai type theorems and 
complex Locally Correctable Codes (LCCs), we define the new notion of stable LCCs, in 
which the (local) correction procedure can also handle small perturbations in the euclidean 
metric. We prove that such stable codes with constant query complexity do not exist. No 
impossibility results were known in any such local setting for more than 2 queries. 

1 Introduction 

The Sylvester-Gallai theorem is a statement about configurations of points in M. d in which 
there is a certain structure of collinear triples. 

Theorem 1.1 (Sylvester-Gallai). Suppose v\, . . . ,v n G M d are such that for all i ^ j G [n] 

there is some k G [n] \ {i,j} for which Vi,Vj,Vk are on a line. Then all the points vi, . . . ,v n 
are on a single line. 



This theorem takes local information about dependencies between points and concludes 
global information about the entire configuration. For more on the history and generalizations 
of this theorem we refer to the survey [BM90j . A complex variant of this theorem was proved 
by Kelly: 

Theorem 1.2 ( |Kel86j ). Suppose v\, . . . ,v n G C d are such that for all i ^ j G [n] there is 
some k G [n] \ {i,j} for which Vi, Vj,Vk are on a (complex) line. Then all the points v\, . . . , v n 
lie on a single (complex) plane. 
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The global dimension bound given by Kelly's theorem is tight since, over the complex 
numbers, there are two-dimensional configurations of points satisfying the condition on triples. 

In a recent work, Barak et. al. [B DWYlT] proved quantitative (or fractional) analogs of 
Kelly's theorem in which the condition 'for all i 7^ j G [n]' is relaxed and we have information 
only on a large subset of the pairs of points for which there exists a third collinear point0. 

Theorem 1.3 ( [BDWYTTj ). Suppose v±,... ,v n G C d are such that for all i G [n] there exist 
at least 5(n — 1) values of j G [n] \ {i} for which there is k G [n] \ {i,j} such that Vi,Vj,Vk are 
on a line. Then all the points vi,...,v n lie in an affine subspace of dimension 13/5 2 . 

A more recent work [D SW12] improves the dimension upper bound obtained in the above 
theorem from 0(1/ S 2 ) to the asymptotically tight 0(1/ 5) and also gives a new proof of Kelly's 
theorem (when 5 = 1 one gets an upper bound of 2 on the dimension). 

In this work we consider configurations of points in which there are many triples that 
are 'almost' collinear, in the sense that there is a line close to all three points (in the usual 
Euclidean metric on C d ). Equivalently, the points are contained in a narrow tube. Our goal 
is to prove stable analogs of the above theorems, where stable means that the conclusion 
of the theorem will not change significantly when perturbing the point set slightly. Clearly, 
in such settings one can only hope to prove that there is a low dimensional subspace that 
approximates the set of points. There are many technical issues to discuss when defining 
approximate collinearity and there are some non trivial examples showing that word-to-word 
generalizations of the above theorems do not hold in the approximate-collinearity setting (at 
least for some of the possible definitions). Nonetheless, we are able to prove several theorems 
of this flavor for configurations of points satisfying certain 'niceness' conditions. We also study 
stable variants of error correcting codes (over the reals) which are locally correctable, in which 
such approximately collinear tuples of points naturally arise from the correcting procedure. 

In [BDWYlT] . a connection was made between the Sylvester-Gallai theorem to a special 
kind of error correcting codes called Locally Correctable Codes (LCCs). In these codes, a 
receiver of a corrupted codeword can recover a single symbol of the codeword correctly, making 
only a small number of queries to the corrupted word. When studying linear LCCs over the 
real or complex numbers one encounters the same type of difficulties in trying to convert local 
dependencies into global dimension bounds. Building on this connection, and our ability to 
analyze 'approximate' linear dependencies, we define the notion of stable LCC and show that 
these do not exist for constant query complexity. Stable LCCs correspond to configurations of 
points with many approximately dependent small subsets and so our techniques can be used 
to analyze them. 

We note here that understanding the possible intersection structure of tubes in high di- 
mensional real space comes up in connection to other geometric problems, most notably the 
Euclidean Kakeya problem [TaoOlj (we do not, however, see a direct connection between our 
results and this difficult problem). 

Our proof techniques extend those of [BDWY11, DS W12] and rely on high rank properties 
of sparse matrices whose support is a 'design'. In this work we go a step further and, instead of 

The sets of points satisfying the conditions of the theorem were called S-SG configurations in BD WYlT] . 
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relying on rank alone, we need to bound the number of small singular values of such matrices. 

Organization: In Section [2] we formally state our results for point configurations. The 
results are stated in several sub-sections, corresponding to different variants of the problem we 
consider. In Section [3] we define stable LCCs and state our results in this scenario. The proofs 
are given in Sections H] - El 

Notations: We use big 'O' notation to suppress absolute constants only. For two complex 
vectors u, v G C d we denote their inner product by (u, v) = Ylf=i u i ' W an d use ||u|| = \J (v, v) 
to denote the £2 norm. For an m x n matrix A, we denote by ||^4|| the norm of A as a 
vector of length ran (i.e., the Forbenius norm). The distance between two points u, v G C d 
is defined to be \\u — v\\ and is denoted dist(«, v). For a set S C C d and a point v G C d we 
denote dist(v,S') = inf ug s dist(u, v). We let S d C C d+1 denote the d-dimensional unit sphere 
in complex d + 1 dimensional space. By fixing a basis we can identify each v G S d with ad+l 
length complex vector of ^-norm equal to one. 

2 Point configurations 

In this section we state our results concerning point configurations. The first section, Sec- 
tion 12.11 deals with the most natural setting - the affine setting - in which we consider sets 
of points in C d with many almost-collinear triples. In Section 12.21 we consider the projective 
setting where the points are located on the sphere and collinearity is replaced with linear 
dependence. Section 12.31 states a more general theorem from which both the affine and the 
projective results follow. 

2.1 The affine setting 

We begin with the definition of an e-line. 

Definition 2.1 (line,line e ). Let u / d£ C d . We define line(u,v) = {au+(l — a)v \ a G C} to be 

the complex line passing through u,v. We define line e (u,v) = {w G C d \ dist(w, line(u,v)) < e}. 

The following definition will be used to replace the notion of dimension with a more stable 
definition. 

Definition 2.2 (dim e ). For a set of points V C C d and e > we denote by dim e (V) to be the 
minimal k such that there exists a k-dimensional subspac^ L C C d such that dist(v, L) < e for 
all v G V. 

To give an idea of the subtleties that arise when dealing with approximate collinearity, 
take an orthonormal basis ei, . . . , ej, in C d and consider the set V = {ei, e[, . . . , e^, e' d } with 

2 The difference of 1 between affine and iinear dimension wili not be significant in this paper and so we use 
a iinear subspace in the definition. 



3 



e\ = (1 + e)ej. Clearly, there is no low dimensional subspace that approximates this set of 
points, even though there are many pairs for which there is a third e-collinear point {e[ is 
e-close to the line passing through ej and any other third point). An obvious solution to 
this problem is to require that the minimal distance between each pair of points is bounded 
from below (say by 1), so that the condition of e-collinearity is meaningful. We now describe 
another, less trivial, example which shows that this condition alone is not sufficient in general. 

Example 2.3. Let e\, . . . ,e& be an orthonormal basis in C . Let V{ = Bei, U{ = (B — l)ej 
for all i G [d] and let V = {ei,Ui,Vi \ i G [d]} be a set of n = 3d points. Then for all i,j G [d] 
we have Ui G line t (vi,ej) and Vi G line e (ui,ej) with e = 1/B. Thus, there are many e-collinear 
triples in V (as in the conditions of Theorem^ with 5 = 1/3). However, for any subspace L 
of dimension o{n), the distance of at least one of the point Uj to L must be at least Q(B) (this 
can be shown, e.g., using Lemma \4.3\ ). 

In this example, we had e = 1/B, where B is roughly equal to the ratio between the smallest 
and the largest distance, or the 'aspect ratio' of V. We will prevent this scenario by requiring 
that e will be sufficiently smaller than 1/B, where B will be the aspect ratio. This motivates 
the following definition. 

Definition 2.4 (i?-balanced). A set V C C d is said to be B-balanced if 1 < dist(v,v') < B for 
all v ^ v' G V. 

The following theorem gives the most easy to state version of our results. 

Theorem 1. Let n,d > be integers and let B,e > be real numbers with e < 1/1QB. Let 

V = {vi, . . . ,v n } C C d be B-balanced and suppose that for every i ^ j G [n] there exists 
ke[n]\ {i,j} such that v k E line e (vi,Vj). Then, dim e /(V) < 0(B 6 ) with e' < 0(eB 2 - 5 ). 

Observe that a corollary of this theorem is that the number of points, n, is bounded from 
above by a function of B. A priori, we did not have this bound since a l?-balanced configuration 
in C d can have an unbounded number of points when d grows. 

Notice that our definition of e-collinearity is not symmetric in that it depends on the order 
of the triple. As is shown in Lemma 15.21 this is not an issue for .B-balanced configurations, 
as long as we are willing to replace e with eB. For general (i.e., non balanced) configurations 
the situation can be more complicated and it is possible that using a stronger collinearity 
condition (e.g., requiring that any permutation of the triple satisfies our condition) is sufficient 
for obtaining a global dimension bound. 

Theorem [T] will be a special case of the following, more general theorem, in which we 
only have information of a subset of the pairs (i,j). Assuming V has many e-collinear triples 
(for each point), we derive an upper bound on dim e /(l/) for e' which depends on the other 
parameters. We also derive a better bound on e' when restricting to a subset of the points. 

Theorem 2. Let n, d > be integers. Let B,5, e > be real numbers with e < 1/16-B. Let 

V = {v\, . . . ,v n } C C d be B-balanced and suppose that for every i G [n] there are at least 
5{n — 1) values of j G [n] \ {i} for which there exists k G [n] \ {i,j} such that G line e (vi,Vj). 
Then 
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1. dinv(y) < 0(B 6 /5 2 ) with e' < 0(eB 2 - 5 /5 05 ). 

2. There exists a subset V C V of size fi(rt) with dinv(y') < 0(B 6 /5 2 ) and e" < 0{Be). 

In both of the above theorems, the parameter B appears in the resulting global dimension 
bound. We suspect that this dependence can be removed so that the bound on the dimension 
will be O(l) in Theorem Q] and 0(l/5 2 ) (or even 0(1/ 6)) in Theorem [2J The blowup in e', 
compared to e is also likely to be suboptimal. 

A stronger definition of collinearity, for which Example 12.31 fails, is to require that each 
point in the triple is e-close to the line spanned by the other two points. Let us call such 
triples strongly e-collinear triples. It is easy to see that, in Example 12.31 the triples do not 
satisfy this stronger definition. Thus, it is possible that one could prove analogs of Theo- 
rem [2] for configurations that are not £>-balanced using this stronger definition of approximate 
collinearity. 

We conclude this discussion with yet another example showing that, even for the case 5 = 1 
(i.e, the original Sylvester-Gallai condition) the weak definition of e-collinearity requires some 
balancedness condition (though potentially weaker). 

Example 2.5. Fix some large B > 0. Take an orthonormal basis e\, . . . , G C d and define 
V = {0} U Uie[d] {-S* -le i> (B 1 ^ 1 + l)ej}. One can verify by induction that for every u,v £ V 
there is a third point inside line t (u,v) with e ~ 1/B. There is also no low dimensional subspace 
that approximates V (similar to the previous examples). 

2.2 The projective setting 

Since the definition of e-collinearity (that is, vt G line e (^, Vj)) is sensitive to scaling, a projective 
statement of Theorem[2j in which these scaling issues do not arise, seems natural. In this setting 
we consider points on a sphere and lines are replaced by circles (two dimensional subspaces 
intersected with S d ). 

Definition 2.6 (circ,circ e ). Let u, v G S d . We define circ(u,v) = span{u,v} n S d . We define 
circ e (u, v) = {w E S d \ dist(w, circ(u, v)) < e}. 

An instructive example in the projective case is the following: 

Example 2.7. Take V to be a maximal set in S d with pairwise distances at least \x > (so 
that n (c//i) d with c a constant). Since every point in S d is of distance at most \i from one 
of the points in V (otherwise we could add it) we get that each set circ^(vi,Vj) contains at least 
0,(1/ jx) > 2 points from V . On the other hand, for any low dimensional subspace L (say, with 
dimension d' independent of n) almost all points in V will have distance at least 1/100 from 
L. 

From this example we see that there needs to be some upper bound on e as a function of 
the minimal distance in the set. We will use the following definition to replace i3-balancedness. 

Definition 2.8 (/i-separated) . A set V C S is said to be [i-separated if for every u ^ v G V 
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we have mm{dist(u,v), dist(u, —v)} > [a. 

We now state our theorem for points on a sphere. 

Theorem 3. Let n,d > be integers and let 5, /i,e > be real numbers with e < ^ 2 /32. Let 
V = {v\, . . . ,v n } C S be fi-separated and suppose that for every i G [n] there are at least 
5(n — 1) values of j G [n\ \ {i} for which there exists k G [n] \ {i,j} such that G circ € (vi,Vj). 
Then 

1. dinv(T/) < 0(1/5V) with e' < 0{e/5°- 5 ^ 2 ' 5 ). 

2. There exists a subset V C V of size fi(n) with dim e »(V') < 0(l/5 2 fi 6 ) and e" < 0(e/fj). 

Notice that, when compared with Theorem [31 the parameters [i corresponds to 1/B. How- 
ever, the condition on e < /i 2 /32 is more restrictive in this case. We do not know whether this 
condition can be improved to e < O(fi). As is the case with Theorem [3j we do not expect the 
dependency in the dimension bound and in e' to be tight. 



2.3 The general statement 

Both Theorem [2] and Theorem [3] will follow from a more general statement requiring a set of 
points with a family of e-dependent triples satisfying certain conditions. 

Definition 2.9 ((e, /^-dependent). We say that a triple of points u,v,w G C d is (e, in- 
dependent if there exists complex numbers a,/3,7 with \a\, |/3|, I7I G 1] such that 

\\au + j3v + ^w\\ < e. 

Definition 2.10 ((p, ^-design). Let T C (3) be a family of triples in [n]. We say that T is 
a (p,g)-design if 

1. For all i G [n] there are at least p triples in T that contain i. 

2. For all i 7^ j G [n] there are at most g triples in T containing both i and j. 

The following theorem gives a low dimensional subspace that approximates all points in a 
configuration in which there is a design of triples that are (e, ^-dependent. Below we will also 
prove a slightly more refined statement (see Theorem I4.ip giving better distance from L for 
many points in the configuration. 

Theorem 4. Let n,d > be integers andp, g, 5, fi, e > be real numbers. Let V = {i>i, . . . , v n } C 
C d , T C ('■3') be such that T is (p, g)-design, and for every {i,j,k} G T the triple V{,Vj,Vk is 
(e, /j,)- dependent. Then, 

In 2 a 2 

dinv(V) < 

with 

e < - 



pfi 2 
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A setting of the parameters which will be most relevant to us is when \T\ is quadratic in 
n, p is linear in n and g and fx are constants. In this case we get a constant upper bound on 
the dimension dim € /(V) with e' = 0(e). 

The proof of Theorem |4] is given in the next section with the proofs of Theorems [2] and [3] 
in Sections [5] and [6] respectively. We give a high level overview of the proof below. 

Proof overview: We place the points v\,. . . ,v n as rows in a matrix A. We then use the 
triple family T to construct a matrix M such that 

• M is a \T\ x n matrix whose support is determined by T. More precisely, the non zero 
coordinates of the i'th row of M, with t E T, will be the three elements in t. 

• The values of the entries of M will be in absolute value between \x and 1. 

• The product M ■ A will have small Forbenius norm. 

We then observe that the matrix X = M*M is diagonal dominant (its diagonal elements 
are much larger than its off-diagonal elements). This implies, using the Hoffman- Wielandt 
inequality, that M has only a few small singular values. From this we get that the columns of 
A must have small distance (on average) to the span of the small singular vectors of M and so 
can be approximated well by a low dimensional space. We then show that the same statement 
holds when one replaces the columns of A with the rows of A (a fact which generalizes the 
simple fact that the row rank is equal to the column rank). Using the bound on the average 
distance of rows we argue that there is a large subset that is approximated well by a low 
dimensional subspace. We then extend this to all points using interpolation. 

3 Stable Locally Correctable Codes 

Before discussing local correction, we briefly mention the exciting recent developments re- 
garding 'standard' (non-local) error correcting codes over the reals. Like in the analogous 
theory over finite fields, one would like to encode (typically via a linear transformation) 
a vector of entries from a given field F by a longer one, such that the original message 
can be decoded even when some entries of the codeword are corrupted. The breakthrough 
of 'compressed sensing' by Donoho and Candes-Tao, and subsequent developments (see e.g. 
[CT05I |RV05| IDon061 IKT071 IDMT07I IGLW09] ) has lead to an understanding of codes over 
the reals that is almost as good as in the finite-field case. In particular, there are real-valued 
codes which achieve the gold-standard of coding theory of constant rate linear codes with effi- 
cient encoding and decoding algorithms from a linear number of errors of arbitrary magnitude. 
Moreover, these codes have stable versions which can recover a vector close to the original 
message even if small errors affect all coordinates of the encoding. Our local variant may be 
viewed as one local analog of such stable codes. 

Informally, Locally Correctable Codes (LCCs) are error correcting codes that allow the 
transmission of information over a noisy channel so that the symbols of the transmitted words 
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have many local dependencies between them. The most general definition requires that one can 
reconstruct (w.h.p) any coordinate in a possibly corrupted codeword, using a small number of 
(randomly chosen) queries to the other coordinates. The noise model is adversarial, meaning 
that the corrupted positions are arbitrary (and not random) and one only has a bound on the 
total number of errors (which is usually assumed to be a small constant fraction). LCCs are 
closely related to another type of codes - Locally Decodable Codes (LDCs)- whose study was 
initiated in a work of Katz and Trevisan [KTOO) . We refer the interested reader to [Yekll] for 
the relevant background on LDCs and LCCs and their applications in computer science. 

The connection between LCCs and the Sylvester-Gallai theorem was first observed in 
[BD WYlT] . When studying the special case of linear LCCs (i.e., LCCs that are given by 
linear mappings over a field) one can easily show that LCCs are equivalent to point configura- 
tions with many linearly dependent small subsets. The general definition of linear LCCs is as 
follows (we fix the field to be C but the same definition works for any field). We use w(v) to 
denote the number of non zero elements in a vector v G C n . 

Definition 3.1 (Linear LCC - first definition). A (q,5)-LCC over C is a linear subspace 
U C C m such that there exists a randomized decoding procedure D : C m x [m] \-t C with the 
following properties: 

1. For all x G U, for all i G [m] and for all v G C m with w{v) < 5m we have that 
D (x + v,i) = Xi with probability at least 3/4 (the probability is taken only over the 
internal randomness of D ). 

2. For every y G C m and i G [m], the decoder D(y,i) reads at most q positions in y. 

The dimension of an LCC is simply its dimension as a subspace ofC m . 

It is shown in [BDWYlTj that, w.l.o.g. the decoding procedure is linear, in the sense that 
it first picks a set of at most q coordinates to read and then outputs a linear combination of 
them (with coefficients in C). This linearity of the decoder implies that, for each coordinate 
in the code, there are many small subsets of the other coordinates that span it. Since each 
coordinate corresponds to a row of the generating matrix of the code, we obtain a configuration 
of points with many dependent small subsets. We will make this formal in the next definition, 
which is equivalent to the first definition, if one replaces S with the slightly worse bound of S/q 
(when q is constant this change is negligible). 

Definition 3.2 (Linear LCC - second definition). We say that a finite set V = {v±, . . . , v n } C 
C d is a (q,S) -LCC if for every i G [n] and every set S C [n] of size \S\ < 5n there exists a set 
J C [n] \ S with | J\ < q such that V{ G span{vj \j G J). 

The main open problem regarding LCCs is to determine the maximum dimension (as a 
function of n) when we fix q, 5 to be constants. Intuitively, the larger d is, the more 'information' 
we can transmit using the code (the rate of the code if d/n). While the case of q = 2 is 
understood quite well (d is at most logarithmic over finite fields and constant over characteristic 
zero [BDWY11, BDSSllJ), it is an open problem to determine the maximum dimension of a q- 
query LCC when q > 2. There are exponential gaps between the known lower and upper bound. 
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For example, when q = 3, the best upper bound is d < 0{yfn) |Woo07t IKdW04] while the 
best constructions give poly-logarithmic d over finite fields and constant d over characteristic 
zero. We refer the reader to the survey article [Dvil2j for more background on LCCs and for 
an overview of the known constructions. 

Due to their roots in coding theory, LCCs were traditionally studied exclusively over finite 
fields. The study of LCCs over arbitrary fields was initiated in [BDWYll] and was motivated 
by its connection to the Sylvester-Gallai theorem. Another motivation comes from a work 
connecting LCCs with an approach for constructing rigid matrices over infinite fields |DvilO| . 
We note here that for q > 2, the best upper bounds on the dimensions of LCCs are the same, 
no matter what the field is. This also motivates the study of LCCs over infinite fields as a 
potentially easier scenario to tackle first, before proceeding to codes over finite fields (where 
we have fewer techniques). 

Our methods enable us to prove strong upper bounds on the dimension of codes that we 
call stable LCCs. Before discussing the relation between stable and non-stable LCCs we give 
the formal definition. 

Definition 3.3 (span B ). Let v,u±, . . . , u m G C d . We say that v G span B (u\, . . . ,u m ) if there 
exist oi, . . . , a m G C with \a,i\ < B for all i and v = a i u i- 

Definition 3.4 (Stable LCC). We say that a finite set V = {v\, . . . , v n } C C d is a (q, 5, B, e)- 
stable LCC if for every i G [n] and every set S C [n] of size \S\ < 5n there exists a set 
J C [n] \ S with | J| < q such that dist(vi, span B (vj \j G J)) < e. 

Notice that this definition is incomparable to Definition 13.21 On the one hand, we restrict 
the linear dependencies to use only coefficients of bounded magnitude. On the other hand, we 
allow the linear combinations to result in an 'approximate' vector, instead of the exact one. 
To see why the bound on the coefficients is natural (once you allow approximate recovery), 
notice that the decoder can handle small perturbations even in the 'correct positions '. Stated 
in the scenario of Definition 13. 1\ suppose that in a received codeword at most 5 fraction of the 
positions are completely changed (to arbitrary values) and, in addition, all other coordinates 
are perturbed by some small a in Euclidean distance. Then, the decoder can still recover 
(approximately) the value of a given codeword coordinate by reading at most q other positions, 
as long as a < c/qB. Since each of the read coordinates is multiplied by a coefficient that can 
be as large as B and the errors sum over q positions, we get at most a ■ qB resulting error in 
the output of the decoderU 

The next simple claim shows that Definition 13.41 is also stable in the sense that, perturbing 
the elements in a stable LCC gives another stable LCC (with slightly worse parameters). 

Claim 3.5. Let V = {vi, . . . ,v n } cC d be a (q, 5, B, e) -stable LCC and let V = {v[, . .. ,v' n } C 
C d be such that c//st(t>j,^) < a for all i G [n]. Then V is a (q, 6, B, e') -stable LCC with 
e' < e + {qB + l)a. 

Proof. Take some G V and a set J C [n] of size \ J\ < q such that dist(vj, span B (fj | j G J)) < 

3 One can potentially define stable LCCs in this sense (as in Definition 13. 1[) and then prove (similarly to 
BDWYll ) that, up to constants, it is equivalent to Definition 13.41 fwe did not verify the details). 
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e. Then, there exist coefficients bj,j G J with < B and such that 



j6J 



< e. 



Replacing «j with v[ we get that 



< e + \\vi - vlW + } ,bj\\vj - v'j\\ < e + (qB + l)<x 



□ 



Notice that, if we didn't have the bound on the coefficients in the span, the small pertur- 
bations would have resulted in large errors in the linear combinations. Intuitively, if u is not 
in span B (ui, . . . , u m ) then a small perturbation to the u^s may result in u being very far from 
span(ui, . . . ,u m ). This explains the need for two separate stability parameters, e and B. 

Our main result regarding stable LCC's is the following theorem: 
Theorem 5. Let V = {vi, . . . , v n } C C d be a (q, 5, B, e)-stable LCC. Then, 

dim e /(y) < 0{(qB/5) 4 ) 

with 

e' = 0{q 2 Be/5 1 ^). 

In particular, when q is a constant and B and 5 are fixed, the upper bound on dim e / can be 
interpreted as saying that there do not exist stable g-query LCCs, where 'do not exist' means 
that the amount of information one can transmit is constant, regardless of the codeword length. 
The proof of Theorem [5l which follows the same lines as the proof of the Sylvester-Gallai type 
theorems, works also for the more general setting where V is allowed to be an ordered multiset 
(i.e., when different v^s can repeat several times). 

If one sets e = the definition of stable LCC changes into a definition of an LCC with 
bounded coefficients. That is, the linear dependencies are required to be exact (as in the usual 
definition of an LCC) and, in addition, need to use bounded coefficients. Applying Theorem [5] 
to this special case one gets e' = and so obtains the stronger conclusion that the set V is 
actually contained in a low dimensional space. Stated more formally, we have: 

Corollary 3.6. Let V = {v t , . . . , v n } C C d be a (q, 6, B, 0) -stable LCC. Then, 

dim(V) < 0((qB/5) 4 ) 
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4 Proof of Theorem 3] 



We will derive Theorem E] from the following, more refined, statement. 

Theorem 4.1. Under the same conditions as in Theorem^ there exists a subspace L C C d 




and such that 



n 

Y,dist( Vl ,L) 2 < 

i=i 




Proof. First, observe that, for convenience, we can take d = n so that the vectors Vi are in 
C n . The case d > n is not interesting since we can restrict our attention to the span of the n 
vectors. The case d < n can be similarly handled by padding each vector with zeros. 

Let m = \T\. We use T to construct an m x n matrix M so that there is a one-to-one 
correspondence between rows of M and elements of T. By our assumptions, for each triple 
t = {h 3,k} E T there are complex numbers a,/3,7 such that \\avi + /3vj + "fVk\\ < e and s.t 
A* < M) I7I < 1- Let denote the row vector in C n with the value a in position z, the 
value f3 in position j, the value 7 in position k and zeros everywhere else. We define M to be 
the matrix with rows st where t goes over all triples in T (in some order). 

Next, let A be a complex n x n matrix whose i'th row is the vector t>j. Then, from our 
definition of the rows of M, we have that the rows of the m x n matrix 



all have norm at most e. 

The next claim summarizes some of the properties of M that we will use. All three items 
follow immediately from the fact that T is a (p, g)-design and the bounds on the entries of M. 

Claim 4.2. Let M be as above and let Mj E C m , j G [n] denote the j'th column of M . Then 

1. Each entry of M has absolute value at least [x and at most 1. 

2. For each j G [n], \\Mj\\ 2 > p/j, 2 . 

3. For each j / / G [n], \(Mj,Mj>)\ < g. 

The main technical ingredient in the proof is the following simple observation regarding 
the eigenvalues of diagonal dominant matrices, i.e., matrices in which the diagonal elements 
are much larger than the off-diagonal elements. This lemma can be viewed as an extension of 
a folklore result regarding the rank of such matrices (see, e.g., [Alo09] ). The proof is a simple 
application of the Hoffman- Wielandt inequality. 



E = MA 



(1) 
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Lemma 4.3. Let X = {Xij)ij£\ n \ be an n x n complex Hermitian matrix with eigenvalues 
Ai,...,A n . Suppose that for all i G [n] we have Xu > K, where K is some positive real 
number. Then, 

\{ie[n) I A,<^/4}|< A^|^f. 

Proof. Let Dbeannxn diagonal matrix with Da = Xu for all i 6 [n] . Clearly, the eigenvalues 
of D are Dn, . . . , D nn . The Hoffman- Wielandt inequality [HW53] states that, under some 
ordering of the eigenvalues of X (w.l.o.g the one we have chosen) we have 



\^-Du\ 2 < \\X -D\\ 2 = Y,\X>, 

86 [n] i^j 



2 

i]\ ■ 



Using the fact that all D^s are at least K we get the required bound. □ 

Let o"i, . . . ,cr n be the singular values of the matrix M (recall that these are the square 
roots of the eigenvalues of the PSD matrix M*M). Let r±,. . . ,r n be the corresponding right 
singular vectors (i.e., the corresponding eigenvectors of M*M). We thus have 

1. n, . . . , r n form an orthonormal basis of <C n . 

2. For each j G [n], \\Mrj\\ = aj. 

3. The vectors Mn, . . . , Mr n are orthogonal (i.e., (Mr,, Mrj) = for i ^ j). 
Let 

J = {j G [n] | o-j < n^/p/2} 

and let 

L = spanjrj | j G J}. 

We will now show that L is of small dimension and that most columns of A are close to L. We 
start by bounding the dimension of L. 

Claim 4.4. Let L be as above. Then \J\ = dim(L) < 2 " 9 t . 



Proof. Consider the n x n matrix X = M*M with eigenvalues o~\ , . . . , <r 2 . By Claim 14.21 the 
diagonal elements of X are all lower-bounded by p/i 2 and the off-diagonal elements of X are 
all upper bounded by g in absolute value. Using Lemma 14.31 an d these bounds on the entries 
of X, we get that 

2n 2 g 2 



\{i€[n] | of <PM 2 /4}| < 



Taking square roots completes the proof. □ 
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Let u\, . . . ,u n denote the columns of A. We can write each Uj in the orthonormal basis 
r\ , . . . , r n in a unique way as 



U.; 



^a jk r k . 



k=l 



Observe that 



dist(itj , L) 2 = E \a jk \ 



(2) 



Denote the rows of the matrix E = MA by Bi,i £ \m\ so that ||ej|| < e for all i E [m]. Let 
/i, . . . , f n be the columns of E and observe that 



£ll/,ll 2 =£lMl 2 < 

je[n] ie[m] 



me 



(3) 



The next claim bounds the sum of distances of the vectors Uj to the subspace L. 
Claim 4.5. With the above notations, we have 



Y,dist( Uj ,L) 2 < 



4me 
fi 2 p 



Proof. Using (EJ) , (J3j) , the orthogonality of the Mrj 's and the fact that Uj > for all j J, 
we have 

me 2 > Ell/iH 2 =Ell M %H 2 
je[n] je[n] 

2 

= E E a ikMr k 
ie[n] fce[n] 

= E E \ a o^°l 

je[n] ke[n] 



> 



MP 



E dist(uj,L) 2 . 



This proves the claim. 



□ 



We now use Claim 14.51 to deduce that many rows of A are close to a low dimensional 
subspace. 
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Claim 4.6. There exists a subspace V C C n with dim(L / ) < 2 ™i^a and s.t 

^c//st(^,L') 2 <-^. 

i=i 

Proof. Let Y be an n x n matrix such that the j'th column of Y is the element of L closest to 
Uj. If we let L' be the span of the rows of Y we have dim(L') < dim(L) and, using Claim [4~5j 

y dist(« 7 -,i/) 2 < iiy-^h 2 = y dist( u „L) 2 < 

je[n] j€[n] 

□ 

This claim completes the proof of Theorem 14.11 □ 
Proof of Theorem [4] using Theorem 14.11 

From Theorem 14.11 we can get a large subset of V that is e'-close to a low dimensional subspace 
L. To derive the conclusion of Theorem^ we will show that the rest of the points in V are also 
close to L, though with a slightly worse bound on the distance. This will follow by showing 
that, for every point v £ V, there are two points u, w G V that are close to L and s.t v is close 
to the line passing through them. This will imply that v is also close to L. The details follow. 

First, apply Theorem 14.11 to get a subspace L so that 

2nV 
dim(L) < — j-j- 



and such that 



Let 



4me 2 



V dist(ui,L) 2 < ., 



I = <i 6 W 



... rN2 4gme 2 

and observe that |/| < p/g. Our final step is to argue that the points Vi,i £ I are also close to 
L' since they are close to the span of two points Vj, with j,k ^ I (using the design properties 
ofT). 

Claim 4.7. For each i € I there are indices j, k G [n] \ I such that {i,j, k} G T. 

Proof. Fix some i £ I. If the claim is false then every triple in T that contains i must have 
some other element in I. By a pigeon hole argument, there must be an element j G I\ {i} and 
at least p/\I\ > g triples containing both i and j, contradicting the design property of T. □ 

We will need the following simple lemma: 
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Lemma 4.8. Let u,v,w G be an (e, p)- dependent triple. Let L C C d be a subspace with 
dist(v, L), dist(u, L) < p for some p > 0. Then dist(w, L) < (e + 2p)/p. 

Proof. Let a,/3,7 be such that \a\, |/3|, I7I G [//, 1] and \\au + j3v + 7io|| < e. Let t/, «' £ L be 
s.t \\v — ||u — < p. Then 

dist(w,L) < \\w + (a/-f)v + (P/-f)u'\\ 

< \\w + (a/7> + (/3/7)m|| + ||(a/7> - («/7)w'l| + II (^/t)*" - (PhWW 

< e/| 7 | + |a/ 7 |p + \(3/ 7 \p 

< (e + 2p)//i. 

□ 



Combining Claim I4T71 with Lemma 14.81 we have that each Vi,i G [n] is e' close to L with 

5, we gei 

5e^/gm 



e' < (e + 2p)/p, where /? = — Simplifying, we get 



e' < 



as was required. This completes the proof of Theorem SJ □ 

5 Proof of Theorem [2] 

We start with some preliminary lemmas. 

Lemma 5.1. Let {u, v,w} G be B-balanced. If w G line e (u,v) with e < 1/2 then the triple 
u,v,w is (e, 1/41?)- dependent. Furthermore, there exists a complex a with \a\ > 1/42? such 
that \\w — au — (1 — a)v \\ < e. 

Proof. By shifting ti; to zero we can assume that both u and v have norm bounded by B. By 
definition, there exists a G C such that ||u> — an — (1 — a)v\\ < e and so we only need to show 
that |a| > 1/41? (the same argument will apply to 1 — a by symmetry). Observe that 

1 < \\w — v\\ 

< \\w — au — (1 — a)v || + ||an|| + ||aa>|| 

< e + 2aB, 

which proves the lemma. □ 

Lemma 5.2. Let {u, v,w} G be B-balanced and let < e < 1/2 be a real number such that 
w G line e (u,v). Then v G line t i(w,u) with e' = 4eB. 

Proof. By Lemma 15 . 1 1 there exists a complex a with \a\ > 1/41? such that 

\\w — av — (1 — q)u|| < e. 



15 



Then 

\\v - (l/a)w + (l/a - l)v\\ < e/a < AeB. 
This completes the proof. □ 

Lemma 5.3. Let u, v G C d be two distinct points. Let k be the maximum size of a B-balanced 
set contained in line e (u,v). If e < 1/4 then k < 5B. 

Proof. Suppose k > 5B and let V = {v%, . . . ,Vk} be a £>-balanced set contained in \\ne e (u,v). 
For each V{ let Ui G line(-u,-u) be a point of distance at most e from it. Since the k points 
ui, ■ ■ ■ , itfc are all on a line segment of length at most 2B we can apply a pigeon hole argument 
to conclude that there must be i / j with dist(nj, Uj) < 2B/(k — 1). This implies dist(uj, v j) < 
2e + 2B/(k - 1) < 1, a contradiction. □ 



Proof of Theorem [2] 

We define T C to be the set of triples {i, j, k} C [n] (with three distinct indices) for which 
Vk G \\ne e (vi,Vj). By Lemma 15. II we have that for each triple {i,j, k} in T, the corresponding 
triple Vi,Vj,Vk G C rf is (e, l/4B)-dependent. 

Claim 5.4. T as defined above is a (p,g) design with p = 5{n — 1) and g < 5B. 

Proof. By the conditions of the theorem, each V{ is contained in at least 5(n — 1) triples that 
are in T and so the bound on p holds. To prove the bound on g, fix % ^ j G [n]. If the triple 
{i,j, k} appears in T. Then either vt G line e (vj, Vj), Vi G \\r\e t (vj,Vk) or Vj G line e (i>j, i^). In all 
three cases, we have, using Lemma [531 that G \'me e >(vi,Vj) with e' = 4eB. Since e < 1/165 
we have e' < 1/4 and we can apply Lemma 15.31 to conclude that there could be at most 5B 
such triples. □ 

Observe that we can discard some of the triples in T so that \T\ < 5n 2 and so that T is 
still a (p, g)-design (simply keep for each i only 5(n — 1) dependent triples). 

Plugging the bounds obtained in the above claims and the bound |T| < 5n 2 into Theorem H] 
we get a subspace L with dim(L) < 0(B G /5 2 ) and such that dist(uj,L) < 0(eB 2 - 5 /VS) for all 
j 6 [ti]. The second part of the theorem follows from applying Theorem 14.11 



6 Proof of Theorem [3] 



We first prove some preliminary lemmas. 



Lemma 6.1. Suppose u,v G S d are s.t mm{dist(u,v), dist(u, —v)} = \i. Then, for all complex 
P, dist(u,/3v) > fi/4. 

Proof. Suppose w.l.o.g dist(«, v) = \i < \pl. We have 



\i = y/{u-v,u-v) = \Jl - 2{u,v), 
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which gives (u, v ) = 1 — /U 2 /2. Since dist(u, jv) is minimized for 7 = (u, v) we have dist(it, j3v) > 
dist(u, (1 - n 2 /2)v) = \\u-v + {n 2 /2)v\\ > \\u - v\\ - \\(fi 2 /2)v\\ > fi - fi 2 /2 > fi/4 (for 
H < V2). □ 

Lemma 6.2. Let u,v,w G S d be distinct and let e,[i > be real numbers s.t e < fi/8. Suppose 
\\w — au — /3v\\ < e for some complex numbers a,f3. If mm{dist(w, v), dist(w, — v)} > fi then 
\a\ > fi/8. 

Proof. By the triangle inequality 

\\w — /3v\\ < \\au\\ + e = \a\ + e. 
Using Lemma IBTTl we have d\st(w,(3v) > fi/4 which gives \a\ > — e > fi/8. □ 

Lemma 6.3. Let u,v,w G S d be fi-separated and suppose e < fi/8. Suppose w G circ e (u,v). 
Then, there exist complex numbers a, /3, 7 with \\au+f3v+jw\\ < e and s.t fj>/8 < \a\,\f3\,\j\ < 1. 

Proof. By the assumption, there are c/,/3' with \\w — a'u — f3'v\\ < e. If \a'\ and |/3'| are at 
most 1 then we are done using Lemma 16, 21 If not, suppose \a'\ = max{|c/|, |/3'|} > 1 and 
divide the equation by a' to obtain ||(l/a')u; — u — (j3' /a')v\\ < e/|a'| < e. Now, all three 
coefficients are at most 1 in absolute value and, using Lemma l6.2( we have the lower bound 
fi/8 on |l/a'|,|/3'/c»!'|. □ 

Lemma 6.4. Let u,v,w G S d be distinct. Let e, fi > be real numbers such that e < n/8. 
Suppose w G circ t (u,v) and mm{dist(w, v), dist(w, — v)} > fi. Then u G circ t /(w,v) with e' = 
8e//i. 

Proof. By our assumption, there exist complex numbers a, /? such that 

||«; — au — j3v\\ < e. 
By Lemma [6721 we have \a\ > /j>/8 and so 

\\u - (l/a)w + (P/a)v\\ < 8e/fi. 
This implies u G circ e /(w, v) as was required. □ 

Lemma 6.5. Let u,v G S d be two distinct points. Let k be the maximum size of a js-separated 
set contained in circ e (u,v). If e < then k < 8//x. 

Proof. Suppose k > 8/ fi and let V = {vi, . . . , v^} be a ^-separated set contained in circ e (u, v). 
For each Vi let Ui G circ(n, v) be a point of distance at most e from it. By a pigeon hole 
argument, there must be i ^ j with min{dist(-Uj, Uj), dist(uj, — Uj)} < ir/k < /i/2. This implies 
min{dist(vj, Vj), dist(t>j, — Uj)} < 2e + /i/2 < /i, a contradiction. □ 

Proof of Theorem^ To reduce to Theorem [5] we will define T C ('3') to be the set of triples 
{i, j, k] C [n] for which Vk G circ^i, Vj). 
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Claim 6.6. Let {i,j, k} G T. Then the triple Vi,vj,Vk G C d is (e, /j,/ 8) -dependent. 



Proof. This is immediate from Lemma 16,31 □ 
Claim 6.7. T as defined above is a (p,g) design with p = 5(n — 1) and g < 8/fJ,. 

Proof. By the conditions of the theorem, each V{ is contained in at least 5(n — 1) triples that 
are in T and so the bound on p holds. To prove the bound on g, fix i ^ j G [n]. If the triple 
{i,j, k} appears in T, then either Vk G circ e (vj, Vj), Vi G circ e («j, Vk) or Vj G circ e (vj, v^). In all 
three cases, we have, using Lemma l6~H that Vk G drc e >(vi, Vj) with e' = Se/fx. Since e < /i 2 /32 
we have e' < /i/4 and we can apply Lemma [631 to conclude that there could be at most 8/fj, 
such triples. □ 

Plugging the bounds obtained in the above claims and the bound \T\ < 5n 2 (which can be 
obtained by discarding some of the triples in T, as before) into Theorem [3] and into Theorem l4.ll 
completes the proof. □ 



7 Proof of Theorem [5] 

Since the proof follows the same lines as the proof of Theorem [U we will assume familiarity 
with the proof of that theorem and only give details where the proofs differ. 

We will use the following definition: 

Definition 7.1 (LCC-matrix). Let M be an nkxn matrix over C and let M±, . . . , M n be kxn 
matrices so that M is the concatenation of the blocks Mi, . . . ,M n placed on top of each other 
(so Mn contains the rows of M numbered k(£ — 1) + 1, . . . , k£). We say that M is a (k, q)-LCC 
matrix if, for each i G [n] the block Mi satisfies the following conditions: 

• Each row of Mj has support size at most q + 1 . 

• All rows in Mi have the value 1 in position i. 

• The supports of two distinct rows in Mi intersect only in position i. 

Let V = {vi, . . . , v n } C C d be a (q, 5, B, e)-stable LCC and assume w.l.o.g that d = n (that 
is, pad the vectors Vi with zeros so that we can think of them as vectors in C"). Let A be the 
n x n matrix with rows Vi. 

Claim 7.2. There exists a (k, q)-LCC matrix M with dimensions nkxn and with k = £l(5n/q) 
such that all entries of M have absolute values at most B and such that 

\\MA\\ 2 <n 2 e 2 . 

Proof. We will show how to construct the kxn block Mi of M (see Definition 17. ip row by row. 
Using the definition of stable LCC, there exists a family Qi of k = Vt(5n/q) disjoint g-tuples of 
elements of V such that, for each g-tuple J G Qi, we have dist(t>j, span B ( J)) < e. Each of these 
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(/-tuples, J, defines a row vector wj with 1 in the i'th position, £>-bounded entries in positions 
indexed by J, and zeros everywhere else in the following manner: Suppose Vi = X^e j ^j v j + e 
with \bj\ < B for all j € J and ||e|| < e. Then we define Wj to have 1 in position i and values 
— bj in positions j € J (with zeros in all other positions). Then, we have ||u;jt4|| = ||e|| < e. 
Taking all these row vectors to construct Mi we get the required bound on ||Mj4|| 2 . □ 

Let E = MA so that \\E\\ 2 < n 2 e 2 . We now construct another nk x n matrix R so 
that R T M will be diagonal dominant. R will be comprised of n blocks, R\, . . . , R n , each of 
dimensions k x n so that Ri has l's in the i'th column and zeros everywhere else. Notice that, 
the z'th row of R T M is the sum of the rows in the block Mj of M . 

Let M = R T M and E = R T E so that E = MA. An application of the Cauchy-Schwarz 
inequality shows that 

\\R T E\\ 2 < n\\E\\ 2 < n 3 e 2 . 

Observe that the diagonal elements of M are all equal to k and that the off-diagonal elements 
of M are all of absolute value at most B (since the supports of rows in Mj are disjoint except 
for the i'th coordinate). 

We proceed with analyzing the spectrum of M. Let r%,...,r n be the right singular vectors 
and <Ti, ... ,a n the corresponding singular values. If we take X = M*M then the diagonal 
elements of X are all at least K 2 > k 2 and the off diagonal elements can be bounded by 
2kB + nB 2 < 0(nB 2 ). If we define 

L = spanjrj | aj < K/2} 

we get that, using Lemma B~3l 

dim(L) < 0(n 4 B 4 /K 4 ) = 0((qB/5) 4 ). 

As in the proof of Theorem we consider the columns U\ , . . . , u n of A and obtain the 
bound 

n 

^dist(^-,L) 2 < 4\\E\\ 2 /K 2 = 0(n 3 e 2 /K 2 ). 
i=i 

This means that there is a subspace V with the same dimension as L such that 

n 

^dist(^,L') 2 < 0(n 3 e 2 /K 2 ). 
i=i 

Thus, there is a set V' C V of size n! > (l—S/2)n such that for all v' G V we have dist(u', L') 2 < 
0(n 2 e 2 /5K 2 ) = 0(q 2 e 2 /5 3 ). To finish the proof we observe that, using the definition of a stable 
LCC, for every v G V there is a g-tuple J C V with dist(uj, span B (J)) < e. Using the bound 
on the distances of elements of V to L and the bound B on the coefficients in the linear 
combinations in span B (J), we get that d\st(v,L') < e + 0(qB ■ (qe/5 15 )) = 0(q 2 Be/S 1 - 5 ). This 
completes the proof of Theorem [5l 
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